首页> 外文OA文献 >Deep neural networks for i-vector language identification of short utterances in cars
【2h】

Deep neural networks for i-vector language identification of short utterances in cars

机译:用于i-vector语言识别汽车短话语的深度神经网络

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

This paper is focused on the application of the Language Identification (LID) technology for intelligent vehicles. We cope with short sentences or words spoken in moving cars in four languages: English, Spanish, German, and Finnish. As the response time of the LID system is crucial for user acceptance in this particular task, speech signals of different durations with total average of 3.8s are analyzed. In this paper, the authors propose the use of Deep Neural Networks (DNN) to model effectively the i-vector space of languages. Both raw i-vectors and session variability compensated i-vectors are evaluated as input vectors to DNNs. The performance of the proposed DNN architecture is compared with both conventional GMM-UBM and i-vector/LDA systems considering the effect of durations of signals. It is shown that the signals with durations between 2 and 3s meet the requirements of this\udapplication, i.e., high accuracy and fast decision, in which the proposed DNN architecture outperforms GMM-UBM and i-vector/LDA systems by 37% and 28%, respectively.
机译:本文重点介绍语言识别(LID)技术在智能汽车中的应用。我们以四种语言处理英语中的简短句子或单词,这些语言是英语,西班牙语,德语和芬兰语。由于LID系统的响应时间对于用户在此特定任务中的接受度至关重要,因此分析了不同持续时间的语音信号,其平均时间为3.8s。在本文中,作者提出使用深度神经网络(DNN)有效地建模语言的i向量空间。原始i向量和经过会话可变性补偿的i向量都被评估为DNN的输入向量。考虑到信号持续时间的影响,将提出的DNN体系结构的性能与常规GMM-UBM和i-vector / LDA系统进行了比较。结果表明,持续时间在2到3s之间的信号满足该\ ud应用的要求,即高精度和快速决策,其中所提出的DNN架构在性能上优于GMM-UBM和i-vector / LDA系统37%和28 %, 分别。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号